Retrieval of Web Documents Using a Fuzzy Hierarchical Clustering

نویسندگان

  • Deepti Gupta
  • Nidhi Tyagi
  • Komal Kumar Bhatia
  • A. K. Sharma
  • Els Lefever
  • Timur Fayruzov
  • Veronique Hoste
  • Martine De Cock
  • Sadaaki Miyamoto
چکیده

The World Wide Web has huge amount of information that is retrieved using information retrieval tool like Search Engine. Page repository of Search Engine contains the web documents downloaded by the crawler. This repository contains variety of web documents from different domains. In this paper, a technique called “Retrieval of Web documents using a fuzzy hierarchical clustering” is being proposed that creates the clusters of web documents using fuzzy hierarchical clustering.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Web Document Clustering Using Fuzzy Equivalence Relations

Conventional clustering means classifying the given data objects as exclusive subsets (clusters).That means we can discriminate clearly whether an object belongs to a cluster or not. However such a partition is insufficient to represent many real situations. Therefore a fuzzy clustering method is offered to construct clusters with uncertain boundaries and allows that one object belongs to overl...

متن کامل

Fuzzy clustering for indexing in the GAMBAL information retrieval system

Gambal is an information retrieval system for indexing and accessing web pages that includes graphical interfaces to ease web page search and accessing. In particular, the interfaces provide the user with tools for navigating through hierarchies of documents and visualize selected documents and similar ones. Here, similarity is either based on Wordnet 1.7 or Latent Semantics Analysis. Graphical...

متن کامل

Hierarchical Fuzzy Clustering Semantics (HFCS) in Web Document for Discovering Latent Semantics

This paper discusses about the future of the World Wide Web development, called Semantic Web. Undoubtedly, Web service is one of the most important services on the Internet, which has had the greatest impact on the generalization of the Internet in human societies. Internet penetration has been an effective factor in growth of the volume of information on the Web. The massive growth of informat...

متن کامل

Hierarchical Clustering of documents-A brief study and implementation in MATLAB

The paper discusses and implements hierarchical clustering of documents. The objective is to group similar documents together using hierarchical clustering methods. The paper aims at organizing a set of documents into clusters. The paper is focused on Web Content mining by clustering web documents. Clustering is done on document corpus in MATLAB environment. The result is groups or clusters of ...

متن کامل

A Novel Indexing Technique for Web Documents using Hierarchical Clustering

The information on the WWW is growing at an exponential rate; therefore, search engines are required to index the downloaded Web documents more efficiently. Web mining techniques like clustering can be used for this purpose. In this paper, a novel technique to index the documents is being proposed that not only indexes the documents more efficiently but also uses hierarchical clustering to keep...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010